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Abstract. - We propose a fluctuation analysis to quantify spatial correlations in complex net- 
works. The approach considers the sequences of degrees along shortest paths in the networks and 
quantifies the fluctuations in analogy to time series. In this work, the Barabasi- Albert (BA) model, 
the Cayley tree at the percolation transition, a fractal network model, and examples of real-world 
networks are studied. While the fluctuation functions for the BA model show exponential decay, 
in the case of the Cayley tree and the fractal network model the fluctuation functions display 
a power-law behavior. The fractal network model comprises long-range anti-correlations. The 
results suggest that the fluctuation exponent provides complementary information to the fractal 
dimension. 
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Introduction. — Networks, consistmg of simple ele- 
ifaents, its nodes and links, can display complex properties, 
such as a broad degree distribution, clustering, modular- 
ity, and many others. This work focuses on the correlation 
between node degrees - the number of links attached to a 
riode. Degree correlations, measuring the likelihood that 
nodes of a given degree arc connected, help to explain im- 
portant features of complex networks that arc beyond the 
degree distribution or clustering. For instance, it has been 
fbund that many networks, such as the coauthorship, film 
actor (IMDb ~ Internet Movie Database), and company 
directors networks display assortative mixing between de- 
grees, indicating that nodes of like degree tend to be con- 
ilected. On the other hand, the Internet (autonomous sys- 
tem), the WWW, and some biological networks, exhibit 
disassortative mixing, where there is a high tendency for 
high degree nodes (hubs) to be connected to low degree 
nodes [Hig. 

Different ways to quantify degree correlations are de- 
fined and used in the literature. One measure is the Pear- 
son correlation coefficient of all linked pairs of nodes [1] 
which derives from the probability distribution p(fci,fc2) 
that two nodes of degree fci and fc2 are connected through 
a link [2112 • Another measure of degree correlations is the 
average degree of neighbors of a degree-fc node [3]. This 



measure may also be obtained as a particular case of the 
matrix p(ki,k2), and classifies networks into assortative 
(disassortative) when this quantity increases (decreases) 
with k. Furthermore, it has been shown that disassorta- 
tivity reflected in p(fci, ^2) is a tightly related property to 
fractality in complex networks [SJE]. 

The different measures of degree correlations consider 
only correlations between nearest neighbor nodes, i.e. only 
correlations between the degrees of nodes at distance 1, 
but not further. Because of this limitation, these measures 
fail to capture much of the rich topological information of 
the network. In this work we introduce an approach that 
extends the idea of degree correlations to larger distances. 
An immediate extension of previous methods could con- 
sist on simply considering the correlation coefficient from 
nearest neighbors to the second, the third, . . . d-th neigh- 
bors. However, in complex networks the direct calculation 
of such a function is not feasible. Here we introduce a fluc- 
tuation analysis that overcomes this issue. We study the 
fluctuations of the degree along shortest paths between 
two nodes and consider the distance between nodes analo- 
gous to time in time-series analysis. Our approach can be 
adapted to study many topological and dynamical corre- 
lation properties in networks, such as node activity, node 
weights, time of node addition, etc. In this work we in- 
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Fig. 1: Examples of degree sequences along shortest paths. For 
arbitrary shortest paths we plot the degree of the nodes versus 
the distance for (a) the fractal network model with e = 0.24 
[at ~ —0.65) and (b) the pin yeast network (offc ~ —0.53). 



troduce the fornialism and focus on degree correlations, 
leaving other properties for further investigation. 

Fluctuation analysis. — In order to quantify long- 
range degree (anti-) correlations at distances larger than 
nearest neighbors, we propose a fluctuation analysis. The 
method consists of the following steps: 

1. Find the shortest path between all pairs of nodes in 
the network (if the shortest path between a pair of 
nodes is not unique, then we consider an arbitrary 
one), see Fig. [TJ 

2. Consider all shortest paths of length d and calculate 
the average Kd = {h) of the sequence {ki) of node 
degrees along each of these paths of length d, where 
1 < Z < (d+1). 

3. Repeat the previous step for all possible values of d 
in the network. 

4. Calculate the fluctuation function F{d) = a{Kd\d), 
which is the conditional standard deviation of the Kd 
at distance d, i.e. the standard deviation of the aver- 
ages Kd for those paths of length d. 

The fluctuation function describes the correlations of 
node degrees along shortest paths [THIO] . When an uncor- 
related time series is partitioned into segments of size s, 
the standard deviation of the segments averages decays as 

Because the overall distance between nodes in a network 
is short (compared to the time span of time series) , we do 
not partition the degree sequences into segments. Instead, 
we consider all shortest path of any length d. Thus, if 
the covariance, C{d), between the degree of nodes at dis- 
tance d scales as C{d) ~ {{ki — {k)){kj — {k))\d) ^ d~'' , 
where (fc) is the average degree of the network, then for 
the fluctuation function we expect 



F{d) - d""^ 



(1) 



where ak = —7/2. Notice that because the average (cal- 
culated in step H]) involves a division by d, the degree fluc- 
tuation exponent ak differs by 1 from the usual Hurst-like 



exponent a [llj of time series analysis: ak = a — 1. For 
asymptotical ak = —1/2 {a = 1/2) the degrees are uncor- 
related. 

Starting from the average degree along a shortest path, 
^d ~ d+i 12i=i ^ji ^"^ '^^^^ express the variance as follows 
E]: 



a'{Kd\d) = {{Kd-{Kd)f) 



(2) 



where {Kd) is the average over all values of Kd- Therefore, 
assuming {Kd) — {k), we flnd 
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In the case of (positive) long-range correlations with cor- 
relation exponent 7, the second term dominates. In the 
limit of large d we can integrate the sum, leading to the 
approximation: 



a^{Kd\d) 

(T{Kd\d) 



(d+l)2 
d-^/2 ^ 



d^-^ 



(3) 



With Eq. dl]) and F{d) = a{Kd\d) we obtain 

ak - -7/2 . (4) 

While positive long-range correlations are characterized 
by fluctuation exponents —1/2 < a^ < [TT], negative 
long-range correlations (long-range anti-correlations) are 
characterized by fluctuation exponents —1 < a^ < —1/2 
[T^HH]- For the former, the covariance scales as C{d) ^ 
d~^ and for the latter we assume C{d) ~ ~{d^^). 

Results. 

Barabasi- Albert model. First we investigate the 
Barabasi-Albert (BA) model which is based on prefer- 
ential attachment and has been introduced to generate 
scale-free networks |15| with power-law degree distribu- 
tion p{k) ^ k^^. The BA model consists of subsequently 
adding nodes to the network and linking them to m al- 
ready existing nodes which are randomly chosen with a 
probability proportional to their degree. In this model, 
the parameter m denotes the number of links attached to 
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Fig. 2: Degree fluctuation functions for the BA model. The 
fluctuation functions, F{d), are plotted against the length 
of the shortest paths d, for (a) m = 3, (b) m — 2, and 
(c) m = 1. The panels (a-c) show F{d) for networks consist- 
ing of 500, 1000, 2000, 4000, and 8000 nodes (from bottom to 
top). Panel (d) shows for m = 2 the slopes of exponential fits 
applied to F{d) as a function of the network sizes. The insets 
in (c) and (d) are the same as the corresponding panels but 
in double-logarithmic representation [inset (c) only A^ — 8000] . 
For each set 100 configurations have been averaged. 



a new node. Therefore, the larger the to, the more cy- 
cles are generated in the network. In addition, when m 
is large the overall diameter of the network - the longest 
shortest path between all pairs of nodes - decreases due to 
the increased number of paths between any pair of nodes. 

In order to analyze the correlations in BA model net- 
works we generate 100 configurations, apply the fluctua- 
tion analysis, and calculate the degree fluctuation func- 
tion F{d) among all configurations. Figure [5] shows the 
obtained F(d) for to ~ 1,2,3 and different network sizes 
N = 500, 1000, . . . , 8000 nodes in semi-logarithmic scale. 
The fluctuation function approximately decays exponen- 
tially according to F{d) ^ Q-d./d.y. ^ jj^ ^j^g Qa,se of m = 2 
(Fig.[2]3) straight lines between d = 2 and d~% are found, 
almost parallel for the different sizes. 

The dependence on the network size is shown in Fig. [5Ji 
where for m = 2 the slope 1/dx is plotted versus the 
number of nodes A'^. In this case, the slope varies between 
l/dx ~ 0.16 for the small networks and l/dx ~ 0.13 for 
the larger ones. Further investigations are needed to clar- 
ify if the slope reaches an asymptotic value. The BA model 
has been shown to be uncorrelated [Tl|3] and from an anal- 
ogy with times series analysis, one would expect that the 
fluctuation function decreases according to Eq. ([ij with 
ak = —1/2 {a = 1/2). However, the short distances in 
the BA model and the exponential decay indicate a finite- 
size effect p_6j. 

Cayley tree. Since many complex networks are small- 
world, their diameter is rather small and therefore the 




Fig. 3: Degree fiuctuation function of the z — 3 Cayley tree 
at percolation transition. The fluctuation function, F{d), is 
plotted against the length of the shortest paths d. The dot- 
ted maroon lines represent the quantiles enclosing 90% of the 
too configurations (each). The dashed straight line is a guide 
to the eye and corresponds to the exponent ak = —1/2. 



largest distance d for which the fluctuation function can 
be calculated is also small (see Fig. ^,h) . To overcome 
this issue we first study a tree structure, the Cayley tree 
at the percolation transition. The fractal nature of this 
system allows us to analyze the fluctuation function at 
longer ranges of d. The Cayley tree [T71[TH] is built by 
starting from a central node and attaching z new nodes to 
it. In the next generation z — 1 additional nodes are linked 
to each of the new nodes of the previous generation. All 
nodes except the ending ones have degree z. In percola- 
tion, starting from the complete Cayley tree, elements of 
the tree are randomly removed. In this case, the perco- 
lation transition is at pc = jzj [HI and the topological 
dimension of the giant component is df = 2 [17j . 

Figure [3] depicts the fluctuation functions F{d) for the 
Cayley tree with z = 3 in generation n = 150 at the per- 
colation transition (together with the quantiles enclosing 
90% of the configurations). In this case all node degrees 
are within the range 1 < fc < 3. For 2 < d < 80 the fiuc- 
tuation function follows a power-law according to Eq. ([T} 
with ak — —1/2 (a ~ 1/2), indicating uncorrelated de- 
grees of the nodes along the paths. The deviations from 
the power-law at large distances are due to the finite size 
of the networks. Since the removal of elements of the Cay- 
ley tree happens randomly and in particular independent 
of each other. Fig. [3] is in agreement with what we ex- 
pect {ak — —1/2 corresponds to uncorrelated degrees), 
supporting the proposed technique. 

Fractal network model. We now use the fractal net- 
work model introduced by Song, Havlin, and Makse [S] 
in which the diameter grows with the networks size as 
N^^'^'. It has been shown that anti-correlations between 
node degrees are a determinant factor for the fractality of 
a network [Sl fT^H^ . 

The fractal network is built iteratively (an illustration 
can be found in Ref. [H]). It starts in generation n = 
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with 2 nodes connected by 1 link. In generation n + 
1, m new nodes are attached to the endpoints of each 
Hnk of the previous generation, (i) With probability 1 — 
e the corresponding link of generation n is removed and 
X < m new links arc added between the new nodes, (ii) 
With probability e the corresponding link remains and 
X — 1 new links connect the new nodes. We use m = 2, 
X — 2, and n = 3,4,5. In the case of e = 0, in the first 
generation the network represents a hexagon consisting 
of 6 nodes and 6 links. In the second generation, each 
link of the hexagon is replaced by another hexagon, and 
so on. The larger e, the more short-cuts are produced 
and the more small-world-like the generated network is. 
Thus, according to the parameter e, the network is fractal 
(e = 0) comprising large diameter or non-fractal (e = 
1) being small-world. On the other hand, although m 
and X do not alter whether the network is fractal or not, 
they are relevant for the number of nodes in the network, 
for node degrees, and for the number of possible paths 
between pairs of nodes [5]. For example, a large value of 
m generates a faster increase in the number of nodes, and a 
larger x generates more cycles and paths between any pair 
of nodes. All three parameters, m, x, e affect the fractal 
dimension, d{ , of the network which is given by [511^ 



ln(2r?T, + x) 
ln(3 - 2e) 



(5) 



The expression diverges when e = 1 for a pure small- world 
network. For ?7i = 2, x = 2 it is df = ]„i^_lf\ ■ The degree 



distribution is given by p{k) 



ln(3-2e) ' 



To analyze this model we generate many configurations 
and apply the fluctuation analysis. Figure S] shows the 
degree fiuctuation functions for the model in generation 
n = 4 and different values of e. For e = (fractal) the fluc- 
tuation function decreases as F{d) ^ d^^ , thus a^ = —1, 
whereas F{d) exhibits some structure which is due to the 
deterministic character of the model when e = 0, i.e. for 
certain distances the sequences of degrees are identical 
leading to vanishing standard deviation among the paths. 
In the case of e = 0.06, the fluctuation function also de- 
creases as a power-law but with an exponent at ~ —0.7. 
Due to finite size effects, F{d) exhibits deviations from a 
power-law for d > 60. For e = 0.24, Fig. |3J;, although 
the network diameter decreases compared to smaller val- 
ues of e, we still find a power-law regime over more than 
1 decade. The inset of Fig. |4]; shows the fluctuation func- 
tions for n — 3 and n — 5. These power-law decays 
support the conjecture of long-range correlations. Since 
a ~ ak + I < 1/2, the degrees are anti-corrclated, con- 
sistent with previous findings. The case e = 1, where the 
networks is small- world, is depicted in Fig. |4ji. The diam- 
eter is much smaller and the decay is close to exponential. 

Next, to obtain the exponents ak we apply least squares 
fits to F{d). In Fig. [5] the exponents a = ak + I are 
plotted against the model parameter e for the networks in 
generations n = 3, 4, 5. We find that the curves of a vs. e 




Fig. 4: Degree fluctuation functions for the fractal model in the 
fourth generation (n = 4). The fluctuation functions, F{d), are 
plotted against the length of the shortest paths d, for (a) e — 
0, (b) e = 0.06, (c) e = 0.24, and (d) e = 1. The dotted 
maroon lines in (b) and (d) represent the quantiles enclosing 
90% of the 250 configurations (each). The dashed straight 
lines are guides to the eye and correspond to the exponents 
(a) Ofc = —1, (b) Qfc = —0.7, and (c) a^ = —0.65. The cases 
e = and e = 1 are deterministic and accordingly there are no 
uncertainty levels. Panel (c) contains an inset with the F{d) 
for the model with e = 0.24 in third (n = 3) and fifth (n = 5, 
25 configurations) generation. The inset in panel (d) is the 
same as the major panel but in linear-log representation. 




Fig. 5: Exponents of the fractal network model. The ex- 
ponents a — Uk + 1 are plotted as a function of the model 
parameter e. The ak-values were obtained from least squares 
fits to the fluctuation functions as exemplified in Fig. |4l We 
use the model in third, fourth, and flfth generation (n — 3, 
n = 4: 250 configurations; n — 5: 25 configurations). The 
green straight solid line represents a power-law fit to the expo- 
nents for n = 5 leading to a^ -f 1 = a ~ e*^ with e ~ 0.2. 
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are parallel for all three values of n. However, the actual 
value of Qffc systematically decreases with the generation n 
due to finite-size effects. Although the range of a is rather 
small, we exclude a logarithmic dependence of a on e since 
for e = we find a = 0. Applying the regression a ~ e*^ 
to the n ~ 5 exponents we find e w 0.2. 

Real-world networks. The fluctuation analysis serves 
as a tool to unravel the long-range degree correlations 
present in real- world networks. In this section we study 
the long-range degree correlations in 4 biological networks, 
(i) the Protein Interaction Network of Yeast [13], with 
777 nodes and 1797 links in which two proteins of the 
Yeast cell are connected if they interact chemically with 
each other, (ii) the Metabolic Network of E. coli [23], with 
2859 nodes and 6890 links, in which metabolites within 
the E. coli cell interact biochemically with each other, 
(iii) the Human Homology Network [5S], with 21710 nodes 
and 1289345 links, where proteins of the human cell are 
connected through a link if they are biologically homol- 
ogous (their sequence of amino-acids present an E-value 
smaller than 10^'^") and (iv) the Homology Network of 
251 Prokaryotic Genomes [55], with 30727 nodes and 
1206654 links, in which proteins of different organism are 
connected through a link if they are biologically homolo- 
gous. These 4 networks have been found to exhibit frac- 
tality [5lli|19l|20|. 

Figures 12^, b show the fluctuation functions obtained for 
these real-world networks. The curves exhibit power-law 
decays over a wide range of d. For large scales deviations 
are found that are due to the finite size of the networks, 
similarly to what is seen for the fractal network model. 
We obtain the exponents ak — —0.53 (pin yeast), ak — 
—0.74 (human homology), at — —0.83 (homology), and 
ak — —0.88 (metabolic). In Fig. [BJ: the corresponding 
a = afe -1- 1 are compared with the fractal dimension values 
that have been found before [SUnHHlUS]. The panel also 
contains the results for the Cayley tree and the fit for 
the fractal model {n = 5) from Fig. |S] when the values df 
are calculated according to Eq. (O and with the model 
parameters {m = 2, a; = 2, variable e). 

The idea of Fig. [SJ: is to compare the fractal dimension 
with the fluctuation exponent. Although our results for 
the real- world networks are scattered in Fig. [BJ;, they dis- 
play a decrease in a as d[ increases. The Cayley tree seems 
to follow the same trend. In contrast, the simulations 
in the fractal model reflect the opposite. This suggests 
that d{ might not be the only quantity related to degree 
anti-correlations and that the fluctuation exponent pro- 
vides complementary information. Further studies, which 
are beyond the scope of this work, are necessary to com- 
pletely determine the variables affecting the value of a. 

Discussion and outlook. — In summary, a method 
for the characterization of spatial degree correlations in 
complex networks is suggested. The technique is based on 
fluctuation analysis in analogy to methods used in times 
series analysis. The fluctuation analysis is applied to the 



degrees of (i) the BA model, (ii) the Cayley tree at perco- 
lation, (iii) the fractal network model, and (iv) examples 
of real-world networks. It is found that fluctuation func- 
tions of the BA model decay exponentially. In contrast, 
the degree correlations in the fractal network model decay 
as a power-law and according to the obtained exponents 
the degrees comprise long-range anti-correlations. Further 
studies are needed to obtain a better understanding of a 
possible connection between power-law anti-correlations 
and fractality of complex networks. 

Such long-range anti-correlations may have structural 
implications regarding the stability of networks with re- 
spect to random removals j27j . Long-range correlations 
could also have an influence on correlations between the 
degree and the betweenness centrality of nodes [15] . Possi- 
ble applications include disease spreading, see e.g. [H] and 
references therein, where it is important whether large de- 
gree nodes tend to be linked to other large degree nodes 
or if such hubs arc separated by small degree nodes, and 
how correlations at larger distances influence the spread- 
ing. Other applications could be climate networks [5DH5^ . 

Nevertheless, further research is needed to evaluate the 
proposed fluctuation analysis and the possible dependence 
of the fluctuation exponents on the system size. In addi- 
tion, an analytical description of the models is aimed in 
order to gain a better comprehension of long-range cor- 
relations in complex networks. Furthermore, it would be 
interesting to vary the model parameters m and x and to 
study other fractal networks, as well as more real-world 
networks. 

Fractality and the small-world property, two seemingly 
contradicting properties are known to coexist in real- world 
networks [H] . Typically real- world networks are found to 
be fractal up to a certain length scale until the small- world 
property kicks in at global scale. Therefore, the method 
presented in this paper may also be applied to networks 
that are small- world and is not limited to fractal networks. 

Finally, we would like to point out that our approach 
can also be extended to study other spatial correlations. 
These can be (i) various network properties, as long as 
they can be attributed to the nodes or edges, such as clus- 
tering, betweenness centrality, and many others, or (ii) 
additional information available in the form of character- 
istic values assigned to the nodes (time of addition of the 
node to the network, activity of the node, etc.), or to the 
links (weight, activity, stability, etc.). In addition, one 
could think about including dynamics and analyze spatio- 
temporal correlations. 
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